## Modularity Optimizer version 1.3.0 by Ludo Waltman and Nees Jan van Eck
##
## Number of nodes: 5707
## Number of edges: 222297
##
## Running Louvain algorithm...
## Maximum modularity in 10 random starts: 0.8954
## Number of communities: 7
## Elapsed time: 0 seconds
We are losing AV zonation again, but it does seem that Integrating by Sequencing Depth is the way to go. I will try with another Transform, the same that was used in Zarkada et al., 2021 and many other scRNASeq datasets, SCTransform
Let’s see if I can recover AV zonation in some way
## Modularity Optimizer version 1.3.0 by Ludo Waltman and Nees Jan van Eck
##
## Number of nodes: 5707
## Number of edges: 218164
##
## Running Louvain algorithm...
## Maximum modularity in 10 random starts: 0.8718
## Number of communities: 7
## Elapsed time: 0 seconds
Here it looks like AV zonation is not well separated again, there are some C6a cells that cluster together with C6v. I will increase the dimensionality of the dataset (since I am at 6 dimensions I should not go below this, specially since the plots make no sense from 5 and below dimensions) and see where that goes
I will go for 17 dimensions now, since I did a quick rescanning and decided to go for that plot, as it is the one I see gives the “clearest” behavior in the dataset
## Modularity Optimizer version 1.3.0 by Ludo Waltman and Nees Jan van Eck
##
## Number of nodes: 5707
## Number of edges: 210128
##
## Running Louvain algorithm...
## Maximum modularity in 10 random starts: 0.9072
## Number of communities: 8
## Elapsed time: 0 seconds
If I take 17 dimensions I can see a visually clearer plot when Integrating the samples. The automatic clustering still gives me a good biological information. Using Carlos’ already stablished clustering (NewClustering) I get that:
Mine Carlos
C7 -> C6a
C6 -> C6v
C5 -> C4i&C4ip
C4 -> C3 (Tip Cells)
C3 -> C2
C2 -> C0a
C1 -> C1+C5
C0 -> C0v
Integration in terms of Sequencing Depth makes the dataset work much better altogether, however, the spatial AV zonation we are used to seeing gets lost and this time I do not know how can I recover it.
If I try going for a higher number of dimensions I get a plot that is very similar to what I got with Irepan when he was here in October. The clustering can be set up so it mimics quite well what Carlos had. One thing to note though, is that C6a gets split into Arteries and Arteries Mutant, where the mutant arteries develop a venous phenotype, that makes them go spatially to that region while still belonging to the arterial cluster (C7)
If this clustering makes sense maybe we should consider using this new UMAP, as it reflects better the biological behavior by discarding batch effects. Alternatively, if we want to go back to a plot that is more similar to what Carlos got, I will need to work longer with this and maybe ask for help.
## R version 4.0.3 (2020-10-10)
## Platform: x86_64-w64-mingw32/x64 (64-bit)
## Running under: Windows 10 x64 (build 19043)
##
## Matrix products: default
##
## locale:
## [1] LC_COLLATE=Spanish_Spain.1252 LC_CTYPE=Spanish_Spain.1252 LC_MONETARY=Spanish_Spain.1252 LC_NUMERIC=C LC_TIME=Spanish_Spain.1252
##
## attached base packages:
## [1] stats graphics grDevices utils datasets methods base
##
## other attached packages:
## [1] patchwork_1.1.1 yaml_2.2.1 rmarkdown_2.11 dplyr_1.0.7 ggplot2_3.3.5 SeuratObject_4.0.4 Seurat_4.0.5 knitr_1.36 BiocStyle_2.18.1
##
## loaded via a namespace (and not attached):
## [1] Rtsne_0.15 colorspace_2.0-2 deldir_1.0-6 ellipsis_0.3.2 ggridges_0.5.3 spatstat.data_2.1-0 farver_2.1.0 leiden_0.3.9 listenv_0.8.0 ggrepel_0.9.1 RSpectra_0.16-0 fansi_0.5.0 codetools_0.2-18 splines_4.0.3 polyclip_1.10-0 jsonlite_1.7.2 ica_1.0-2 cluster_2.1.2 png_0.1-7 uwot_0.1.11 shiny_1.7.1 sctransform_0.3.2 spatstat.sparse_2.0-0 BiocManager_1.30.16 compiler_4.0.3 httr_1.4.2 Matrix_1.3-4 fastmap_1.1.0 lazyeval_0.2.2 later_1.3.0 htmltools_0.5.2 tools_4.0.3 igraph_1.2.9 gtable_0.3.0 glue_1.5.1 RANN_2.6.1 reshape2_1.4.4 Rcpp_1.0.7 scattermore_0.7 jquerylib_0.1.4 vctrs_0.3.8 nlme_3.1-153 lmtest_0.9-39 xfun_0.26 stringr_1.4.0 globals_0.14.0 mime_0.12 miniUI_0.1.1.1 lifecycle_1.0.1 irlba_2.3.3 goftest_1.2-3 future_1.23.0 MASS_7.3-54 zoo_1.8-9 scales_1.1.1 spatstat.core_2.3-2 promises_1.2.0.1 spatstat.utils_2.2-0 parallel_4.0.3 RColorBrewer_1.1-2 reticulate_1.22 pbapply_1.5-0 gridExtra_2.3 sass_0.4.0 rpart_4.1-15 stringi_1.7.6 highr_0.9 rlang_0.4.12 pkgconfig_2.0.3 matrixStats_0.61.0 evaluate_0.14 lattice_0.20-45 ROCR_1.0-11 purrr_0.3.4 tensor_1.5 labeling_0.4.2 htmlwidgets_1.5.4 cowplot_1.1.1 tidyselect_1.1.1 parallelly_1.29.0 RcppAnnoy_0.0.19 plyr_1.8.6 magrittr_2.0.1 bookdown_0.24 R6_2.5.1 generics_0.1.1 withr_2.4.3 pillar_1.6.4 mgcv_1.8-38 fitdistrplus_1.1-6 survival_3.2-13 abind_1.4-5 tibble_3.1.6 future.apply_1.8.1 crayon_1.4.2 KernSmooth_2.23-20 utf8_1.2.2 spatstat.geom_2.3-0 plotly_4.10.0 grid_4.0.3 data.table_1.14.2 digest_0.6.29 xtable_1.8-4 tidyr_1.1.4 httpuv_1.6.3 munsell_0.5.0 viridisLite_0.4.0 bslib_0.3.1